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PRODUCTION OF PROTEINS 



USING HOMOLOGOUS RECOHBINHTION 
INTRODUCTION 

Technical Field 

The field of this invention is the 
expression of maimnalian proteins* 

Background 

The discoveries of restriction enzymes, 
cloning, sequencing, reverse transcriptase, and 
monoclonal antibodies has resulted in extraordinary 
capabilities in isolating, identifying, and 
manipulating nucleic acid sequences. As a result of 
these capabilities, numerous genes and their 
transcriptional control elements have been identified 
and manipulated. The genes liave been used for 
producing large amounts of a desired protein in 
heterologous hosts (bacterial and eukaryotic host cell 
systems). 

In many cases, the process of obtaining 
coding sequences and eliciting their expression has 
been a long and arduous one. The identification of 
the coding sequence, either cDNA or genomic DNA, has 
frequently involved the construction of libraries, 
identification of fragments of the open reading frame, 
examining the flanking sequences, and the like. In 
mammalian genes where introns are frequently 
encountered, in many instances, the coding region has 
been only small fraction of the total nucleic acid 
associated with the gene. In other cases, pseudogenes 
or multi-membered gene families have obscured the 
ability to isolate a particular gene of interest. 
Nevertheless, as techniques have improved, there has 
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been a continupxis parade of successful identifications 
and isolation of genes of interest. 

In many situations one is primarily 
Interested in a source of the protein product. The 
cell type in the body which produces the protein is 
frequently an inadequate source, since the protein may 
be produced in low amounts, the protein may only be 
produced in a differentiated host cell which is only 
difficultly grown in culture, or the host cell, 
particularly a human cell, is not economic or 
efficient in a culture process for production of the 
product. There is, therefore, significant interest in 
developing alternative techniques for producing 
proteins of interest in culture with cells which 
15 provide for economic and efficient production of the 
desired protein and, when possible, appropriate 
processing of the protein product. 

Relevant Literature 

Mansour et al. , Nature . 336:348-352 (1988), 
describe a general strategy for targeting mutations to 
non-selectable genes . Weidle et al . , Gene, 66 : 193- 
203, (1988), describe amplification of tissue-type 
plasminogen activator with a DHPR gene and loss of 
amplification in the absence of selective pressure. 
Humane and Yezzi, Somatic Cell and Moleein^r. 
Genetics, 14:273-286, (1988), describe transformation 
of a human cell line with an integrated selectable 
gene marker lacking a transcriptional promoter, with 
tandem duplication and amplification of the gene 
marker. Thomas and Capecchi, Cell , 51:503-512, 
(19871, describe site-directed mutagenesis by gene 
targeting in mouse embryo-derived stem cells. Song 
— Proc. Natl. Acad. Sc^ . uSA, 84»6820-6824, 

(1987), describe homologous recombination in human 
cells by a two staged integration. Liskay et al . , 
-Homologous Recombination Between Repeated Chromosomal 
Sequences in Mouse Cells," Cold Spring Harbor, Symp. 



20 



25 



30 



35 



Quant, Biol , 49:13-189, (1984), describe integration 
of two different mutations of the same gene and 
homologous recombination between the mutant genes. 
Rubnitz and Subramani, Mol, and Cell, Biol . 4:2253- 
2258, (198A), describe the minimum amount of homology 
required for homologous recombination in m amma lian 
cells. Kim and Smithies, Nucl. Acids, Res . 16:8887- 
8903, (1988), describe an assay for homologous 
recombination using the polymerase chain reaction. 

SUMMARY OF THE INVENTION 
Expression of mammalian proteins of interest 
is achieved by employing homologous recombination for 
integration of an amplifiable gene and other 
regulatory sequences in proximity to a gene of 
interest without interruption of the production of a 
proper transcript. The region comprising the 
amplifiable gene and the gene of interest may be 
amplified, the genome fragmented and directly or 
indirectly transferred to an expression host for 
expression of the target protein. If not previously 
amplified, the target region is then amplified, and 
the cell population screened for cells producing the 
target protein. Cells which produce the target 
protein at high and stable levels are escpanded and 
used for expression of the target protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a diagraimnatic illustration of the 
plasmid pCG.l showing the sequence of the modified 
poly linker; 

FIG. 2 is a diagraimnatic illustration of the 
construction of the plasmid pCG.HRl; 

FIG. 3 is a diagrammatic illustration of the 
result of targeting the EPO locus by homologous 
recombination with the DNA from pCG.HRl cut with Not I.; 
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FIG. 4 is a diagrammatic illustration of the PGR 
amplication fragment produced from cells in which a 
homologous recombination event has occurred. 

5 DESCRIPTION OF SPECIFIC EMBODIMENTS 

Methods and conqpositions are provided for 
production of mammalian proteins of interest in 
culture. The method employs homologous recombination 
in a host cell for integrating an amplif iable gene in 

10 the vicinity of a target gene, which target gene 
encodes the protein of interest. The region 
comprising both the amplif iable gene and target gene 
. will be referred to as the an^lif iable region. The 
resulting transformed primary cells may now be 

15 subjected to conditions ^diich select for 

a3i5>lification, or the amplification may be performed 
subsequently. "Transform" includes transform, 
transfect, transduce, conjugation, fusion, 
electroporation or any other technique for introducing 

20 DXn. into a viable cell. The chromosomes or DNA of the 
transformed cells are then used to transfer the 
amplif iable region into the genome of secondary 
expression host cells, where the target region, if 
not previously amplified sufficiently or at all, is 

25 further amplified. The resulting cell lines are 
screened for production of the target protein and 
secondary cell lines selected for desired levels of 
production, which cells may be expanded and used for 
production of the desired protein in culture. 

^° The primary cell may be any mammalian cell 

of interest, particularly manmalian cells which do not 
grow readily in culture, more particularly primate 
cells, especially human cells, where the human cells 
may be normal cells, including embryonic or neoplastic 

35 cells, particularly normal cells. Various cell types 
may be employed as the primary cells, including 
fibroblasts, particularly diploid skin fibroblasts, 
lymphocytes, epithelial cells, neurons, endothelial 



wo 91/06667 



PCr/US90/06436 



5 

cells, or other somatic cells, or germ cells. Of 
particular interest are skin fibroblasts, which can be 
readily propagated to provide for large nuinbers of 
normal cells, embryonic kidney cells, and the like. 
5 These cells may or may not be expressing the gene of 
interest. In those instances where the target gene is 
inducible or only expressed in certain differentiated 
cells, one may select cells in which the target gene 
is expressed, which may require immortalized cells 

10 capable of growth in culture. 

A number of amplifiable genes exist, where by 
appropriate use of a selection agent, a gene 
^ integrated in the genome will be amplified with 
adjacent flanking DNA. Amplifiable genes include 

15 dihydrofolate reductase, metallothionein-I and ^tl, 
preferably primate metallothionein genes, adenosine 
deaminase, ornithine decarboxylase, etc. The 
amplifiable gene will have transcriptional signals 
which are functional in the secondary or expression 

20 host euid desirably be functional in the primary host, 
particularly where amplification is employed in the 
primary host or the amplifiable gene is used as a 
marker. 

The target genes may be any gene of 
25 interest, there already having been a large nuadber of 
proteins of interest identified and isolated with 
continual additions to the list. Proteins of 
interest include cytokines, such as interleukihs 1-*10; 
growth factors such as EGF, FGF, PDGF, and TGF; 
30 somatotropins; growth hormones; colony stimulating 

factors, such as G-,^ M-, and GM-CSF; erythropoietin; 
plasminogen activators, such as tissue and urine; 
enzymes, such as superoxide dismutase; interferons; 
T-cell receptors; surface membrane proteins; insulin; 
35 lipoproteins; ai^antitrypsin; CD proteins, such as 
CD3, 4, 8, 19; clotting factors, e.g.. Factor VIIIc 
and von Willebrands factor; anticlotting factors, such 
as Protein C; atrial naturetic factor, tumor necrosis 
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factor; transport proteins; homing receptors; 
addressins; regulatory proteins; etc. 

For homologous recombination, constructs 
will be prepared where the amplif iable gene will be 
flanked on one or both sides with DNA homologous with 
the DNA of the target region. The homologous DNA will 
generally be within 100 kb, usually 50 kb, preferably 
about 25 kb, of the transcribed region of the target 
gene, more preferably within 2 kb of the target gene. 
By gene is intended the coding region and those 
sequences required for transcription of a mature 
mRNA.. The homologous DNA may include the 5 '-upstream 
region con^rising any enhancer sequences, 
transcriptional initiation sequences, the region 5' of 
15 these sequences, or the like. The homologous region 
may include a portion of the coding region, where the 
coding region may be comprised only of an open reading 
frame or combination of exons and introns. The 
homologous region may comprise all or a portion of an 
intron, where all or a portion of one or more exons 
may also be present. Alternatively, the homologous 
region may comprise the 3 '-region, so as to comprise 
all or a portion of the transcription termination 
region,, or the region 3' of this region. The 
25 homologous regions may extend over all or a portion of 
the target gene or be outside the target gene 
comprising all or a portion of the transcriptional 
regulatory regions and/or the structural gene. For 
the most part, the homologous sequence will be joined 
to the amplif iable gene, proximally or dis tally. 
Usually a- sequence other than the wild-type sequence 
normally associated with the target gene will be used 
to separate the homologous sequence from the 
an^lifiable gene on at least one side of the 
35 amplif iable gene. Some portion of the sequence may be 
the 5' or 3' sequence associated with the aii5)lif iable 
gene, as a result of the manipulations 
associated with the amplif iable gene. 



20 



30 



The homologous regions flanking the 
amplifiable gene need not be identical to the target 
region, where in vitro mutagenesis is desired. For 
example, one may wish to change the transcriptional 
initiation region for the target gene, so that a 
portion of the homologous region might comprise 
nucleotides different from the wild-type 5' region of 
the target gene. Alternatively, one could provide for 
insertion of a transcriptional initiation region 
different from the wild-type initiation region 
between the wild- type initiation region and the 
structural gene. Similarly, one might wish to 

' introduce various mutations into the structural gene, 
so that the homologous region would comprise 
mismatches, resulting in a change in the encoded 
protein. For example, a signal leader sequence would 
be introduced in proper reading frame with the target 
gene to provide for secretion of the target protein 
expression product. Alternatively, one might change 
the 3' region, e.g., untranslated region, 
polyadenylation site, etc. of the target gene. 
Therefore, by homologous recombination, one can 
provide for maintaining the integrity of the target 
gene, so as to express the wild- type . protein under 
the transcriptional regulation of the wild-type 
promoter or one may provide for a change in 
transcriptional regulation, processing or sequence of 
the target gene. In some instances, one may wish to 
introduce an enhancer in relation to the 
transcriptional initiation region, which can be 

provided by, for example, integration of the 
amplifiable gene associated with the enhancer in a 
region upstream from the transcriptional initiation 

regulatory region or in an intron or even downstream 
from the target gene. 

In order to prepare the subject constructs,, 
it will be necessary to know the sequence which is 

targeted for homologous recombination. While it is 
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reported -that: a sequence of 14 bases complementary to 
a sequence in a genome may provide for homologous 
recombination/ normally the individual flanking 
sequences will be at least about 150 bp, and may be 
12 kb or more, usually not more than about 8 kb. The 
size of the flanking regions will be determined by the 
size of the known sequence, the number of sequences in 
the genome which may have homology to the site for 
integration, whether mutagenesis is involved and the 
extent of separation of the regions for mutagenesis, 
the particular site for integration, or the like. 

The integrating constructs may be prepared 
in accordance with conventional ways, where sequences 
may be synthesized, isolated from natural sources, 
manipulated, cloned, ligated, subjected to in vitro 
mutagenesis, primer repair, or the like. At various 
stages, the joined sequences may be cloned, and 
analyzed by restriction analysis, sequencing, or the 
like. Usually the construct will be carried on a 
cloning vector comprising a replication system 
functional in a prokaryotic host, e.g., E. coli, and a 
marker for selection, e.g., biocide resistance, 
COTiplementation to an auxotrophic host, etc. Other 
functional sequences may also be present, such as 
polylinkers, for ease of introduction and excision of 
the construct or portions thereof, or the like. A 
large number of cloning vectors are available such as 
PBR322, the pUC series, etc. 

Once the construct is prepared, it may then 
be used for homologous recombination in the primary 
cell target. Various techniques may be employed for 
integrating the construct into the genome of the 
primary cell without being joined to a replication 
system functional in the primary host. See for 
example, D.S. Patent No. 4,319,216, as well as the 
references cited in the Relevant Literature section. 
Alternatively, the cons timet may be inserted into an 
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appropriate vector, usually having a viral replication 
system, such as SV40, bovine papilloma virus, 
adenovirus, or the like. The linear DNA sequence 
vector may also have a selectable marker for 
5 identifying trans fected cells. Selectable markers 
include the neo gene, allowing for selection with 
6418, the herpes tk gene for selection with HAT 
medium, opt gene with mycophenolic acid, 
complementation of an auxotrophic host, etc. 
10 The vector may or may not be capable of 

stable maintenance in the host . Where the vector is 
capable of stable maintenance, the cells will be 
screened for homologous integration of the vector into 
the genome of the host, where various techniques for 
15 curing the cells may be employed. Where the vector is 
not capable of stable maintenance, for exastple, where 
a teinperature sensitive replication system is 
employed, one may change the temperature from the 
permissive temperature to the non-permissive 
20 temperature, so that the cells may be cured of the 
vector. In this case, only those cells having . 
integration of the construct comprising the 
amplifietble gene and, when present, the selectable 
marker, will be able to survive selection. 
25 Where a selectable marker is present , one 

may select for the presence of the construct by means 
of the selectable marker. Where the selectable marker 
is not present, one may select for the presence of the 
construct by the amplifiable gene. For the neo gene 
30 or the herpes tk gene, one could employ a meditim for 
growth of the transformants of about 0.1-1 g/ml of 
G418 or HAT medium respectively. Where DHFR is the 
amplifiable gene, the selective medium may include 
from about 0.01-0.25 fM of methotrexate. 
35 In carrying out the homologous 

recombination, the DNA will be introduced into the 
primary cells. Techniques which may be used include 
calcium phosphate/DNA co-precipitates, microinjection 
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of DNA into the nucleus, electroporation, bacterial 
protoplast fusion with intact cells, trans fection, 
polycations, e.g., polybrene, polyomithJLne, etc., or 
the like. The DMA may be single or double stranded 
. DNA, linear or circular. For various techniques for 
transforming mammalian cells, see Keown et al.. 
Methods in Enzvmoloqy (1989), Keown et al., Hfethods 
and Enzvmoloov (1990) Vol. 185, pp. 527-537 and 
Mansour et al. , Nature . 336:348-352, (1988). 

Upstream and/or downstream from the target 
region construct may be a gene which provides for 
identification of whether a double crossover has 
occurred. For this purpose, the herpes simplex virus 
thymidine kinase gene may be employed since the 
15 presence of the thymidine kinase gene may be detected 
by the use of nucleoside analogs, such as acyclovir or 
gancyclovir, for their cytotoxic effects on cells that 
contain a functional HSV-tk gene. The absence of 
sensitivity to these nucleoside analogs indicates the 
20 absence of the thymidine kinase and, therefore, where 
homologous recombination has occurred, that a double 
crossover event has also occuirred. 

The presence of the marker gene as evidenced 
by resistance to a biocide or growth in a medium which 
25 selects for the presence of the marker gene, 

establishes the presence and integration of the target 
construct into the host genome. No further selection 
need be made at this time, since the selection will be 
made in the secondary expression host, where 
30 expression of the amplified target gene may be 

detected.- If one wishes, one can determine whether 
homologous recombination has occurred by employing PCR 
and sequencing the resulting amplified DNA sequences. 
If desired, amplification may be performed at this 
35 time by stressing the primary cells with the 

appropriate amplifying reagent, so that multi-copies 
of the target gene are obtained. Alternatively, 
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amplification may await transfer to the secondary cell 
expression host. 

High molecular weight DNA, greater than 
about 201cb, preferably greater than about 501cb DNA or 
5 preferably metaphase chromosomes are prepared from the 
primary recipient cell strain having the appropriate 
integration of the amplification vector. Preparation 
and isolation techniques are described by Nelson and 
Housman, In Gene Transfer (ed. R. Kucherlapati ) Plenum 
10 Press, 1986. The DNA may then be introduced in the 

same manner as described above into the secondary host 
expression cells, using the same or different 
. techniques than employed for the primary cells. 
Various mammalian expression hosts are available and 
15 may be employed. These hosts include CHO cells, 

monkey kidney cells, C127 mouse fibroblasts , 3T3 mouse 
cells, Vero cells, etc. Desirably the hostis will have 
a negative background for the axnplif iable gene or a 
gene which is substantially less responsive to the 
20 amplifying agent. 

The transformed cells are grown in selective 
medium containing about 0.01-0.5 /M methotrexate and, 
where another marker is present, e.g., the neo gene, 
the medium may contain from about 0.1-1 mg/ml 6418. 
25 The resistant colonies are isolated and may then be 
analyzed for the presence of the construct in 
juxtaposition to the target gene. This may be as a 
result of detection of expression of the target gene 
product, where there will normally be a negative 
30 background for the target gene product, use of PGR, 
Southern hybridization, or the like. 

The cells containing the construct are then 
expanded and subjected to selection and ainplification 
with media containing progressively higher 
35 concentrations of the amplifying reagent, for. 

example, 0.5-200 ;iH of methotrexate for the DHFR gen^, 
and may be analyzed at each selection step for 
production of the target product. Expansion will 
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include at least duplication and may result in at 
least 5 copies, preferably 10 copies or more in a 
tandem relationship. Thus protein production will be 
increased at least 1.5 fold from eapression from a 
single copy, usually at least 3 fold, preferably at 
least 5 fold. 

The various clones may then be screened for 
optimum stable production of the target product and 
these clones may then be expanded and used 
commercially for production in culture, in this 
manner, high yields of a product may be obtained, 
without the necessity of isolating the message and 
, doing the various memipulations associated with 
genetic engineering or isolating the genomic gene, 
where very large genes can be a major research and 
development effort. 

The following examples are offered by way of 
illustration and not by way of limitation. 

EXPERIMENTAL 

Cells 

Normal human diploid skin fibroblasts, 
("primary recipient") are propagated in EEMEM medium 
supplemented with 20% fetal calf serum. Dihydrofolate 
reductase (DHFR) deficient Chinese hamster ovary (CHO) 
DDKX-Bll cells (Urlaub and Chasin, Proc. Natl. Acad. 
Sci. USA 77:4216-4220 (1980)) ( "secondary recipient " ) 
are propagated in alpha-medium supplemented with 10% 
dialyzed fetal bovine serum. 

DNA Vector 

The amplification vector is constructed 
from PUC19 (Yanisch-Perron et al. , Gene 33x103-119 
(1985)). A 1.8 kb Haeli fragment containing a 
hygromycin B phosphotransferase gene (hph) driven by 
the herpes simplex virus thymidine kinase (HSV tk) 
promoter is isolated from pHyg (Sugden et al., Mol. 
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Cell, Biol > 5x410-413 (1985)) by digestion with Haell 
and gel electrophoresis. Synthetic adaptors are added 
onto this fragment to convert the Hae ll ends into 
Hin di! I ends and the resulting fragment is joined to 
5 pDC19 digested with Hindlll,^ The resulting plasmid 
pUCH contains the hygromycin cassette such that 
transcription of hph and beta-lactamase are in the 
opposite orientation. A 1.3 kb Sail fragment 
containing a DHFR gene driven by SV40 transcriptional 
10 signals is isolated from pTND {Connors et al., DNA 
7x651-661 (1988)) by digestion with Sai l and gel 
electrophoresis. This fragment is ligated to pOCH 
. digested with Sail. The resulting plasmid pUCD 
contains the DHFR cassette such that DHFR and are 
15 transcribed in the same direction. A 1.76 kb BamH I 

fragment from the phage F15 (Friezner Degen et al., J. 
Biol. Chem . 261x6972-6985 (1986)) which contains 1.45 
kb of DNA flanking the transcriptional start of human 
tissue plasminogen activator (t-PA) in addition to the 
20 first exon and part of the first intron is isolated by 
gel electrophoresis after BamH I digestion. This 
fragment is joined to pUCD following digestion of the 
latter with BamH I. The resulting plasmid pUCG has 
the promoter of the t-PA fragment oriented opposite to 
25 that of the DHFR cassette. The t-PA fragment contains 
a single Kco l site, which is not unique to pUCG. A 
partial Nco l digest is carried out and a Not! linker 
is inserted. The resulting plasmid pCG contains a 
unique Not I site in the t-PA fragment which allows the 
30 plasmid to be linearized prior to transformation of 
the primary human diploid fibroblasts in order to 
increase the frequency of homologous recombination 
(Kucherlapati et al., Proc. Natl. Acad. Sci . USA 
81x3153-3157 (1984)). 



Preparation of Primary Recipients 

The plasmid pCG linearized with Not I is 
introduced into the primary recipients by 
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electroporatioa. employing DNA at lOnM. The resulting 
cells are then grown in selective medium (EEMEM with 
200 |*g/ml hygromycin B) . Resistant colonies are 
isolated and analyzed by PCR (Kim and Smithies, 
Nucleic Acids Res. 16:8887-8903 (1988)) using as 
primers the sequences GCGGCCTCGGCCTCTGCATA and 
CATCTCCCCTCTGGAGTGGA to distinguish homologous 
integrants from random ones. Amplification of 
cellular DNA by PCR using these two primers yields a 
fragment of 1.9 kb only when DNA from correctly 
targeted cells is present. Cells comprising the DHFR 
gene integrated into the t-PA region are expanded and 
used as a source of genetic material for preparation 
of secondary recipients. 



Preparati on of Secondary Recipients 

Metaphase chromosomes are prepared Nelson et 
Sl'f J. MOl. Appl. Genet. 2:563-577 (1984)) from 
recipients demonstrating homologous recombination with 
the DHFR and are then transformed in DHPR-deficient 
CHO cells by calcium phosphate mediated gene transfer 
(Nelson et al., J. M&l. AppI. Genet . 2:563-577 
(1984)). The cells are then grown in selective medium 
(alpha-medium containing 200 ;tg/ml hygromycin B) . 
25 Resistant colonies are isolated and analyzed for 

expression of human t-PA (Kaufman et al . , MOl. Cell. 
Biol. 5:1750-1759 (1985)). The cell clones are thei^ 
grown in selective medium containing progressively 
higher concentrations of methotrexate (.02-80 ftK, with 
steps of 4-fold increases in concentration) . After 
this aii5)lification procedure, the cells are harvested 
and the human t-PA is analyzed enrploying an ELISA 
assay with a monoclonal antibody specific for t-PA 
(Weidle and Buckel, Gene 51:31-41 (1987)). Clones 
providing for high levels of expression of t-PA are 
stored for subsequent use. 
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laolation of a Genomic Clone Containing 
Sequences for Targeting 
Erythropoietin 
A clone was obtained by screening a human 
placental DNA genomic library (Clontech) in EMBL 3- 
SP6/T7 using two 36 bp oligonucleotide probes 5'- 
CTGGGTT6CTGAGTTCCGCAAAGTAGCTGGGTCTGG-3 ' and 5'- 
CGGGGGTCGGGGCTGTTATCTGCATGTGTGCGTGCG-3 ' to the 
presumed promoter region of human erythropoietin. 
Prom this clone two subclones were created in pSP72 
(Krieg and Melton (1987) Meth. Enzymol. 155 / 397-415), 
one containing a 5 kb BamHI-Hindlll fragment from the 
region upstream to the coding region of EPO (pTD.l) 
and one containing a 5 kb Hindlll-BamHI fragment 
coding for EPO (pTD.2). 

Construction of DNA Fragment for 
Targeting Erythropoietin 

A plasmid pCG.l was constructed by 
replacement of the polylinker of pBluescript SK(-) 
(Stratagene) between the Sad and Kpnl sites with a 
synthetic double stranded 72 base pair DNA fragment 
(FIG. 1). Referring to FIG* 2, into pCG.l was cloned 
between the Hindlll and Zbal sites a 678 bp fragment 
containing the enhancer and promoter of the Immediate 
early gene of human cytomegalovirus (CM7, Boshart et 
al (1985) Cell 41, 521*530) obtained by a PCR 
amplification of the plasmid pUCH.CMV (gift of 
H. Calos, Stanford U«) using the oligonucleotide 
primers 5'- 

CGCCAAGCTTGGCCATTGCATACGTT-3 ' and 5'- 

GAGGTCTAGACGGTTCACTAAACGAGCTCT-3' in order to engineer 
Hindlll and Xbal sites respectively onto the ends of 
the resultant fragment. The resultant plasmid pCG.CHV 
was used for further constructions. 

The 620 bp BstEII-*XbaI fragment from the 
pTD.2 was joined by the use of a BstEII-Xbal adapter 
to pCG.CMV restricted with Xbal to create the plasmid 
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pCG.CMV/EPO, in which the BstEII site of the EPO 
fragment is next to the promoter end of the CM7 
fragment, into pCG.CM7/EP0 was cloned successively a 
1.94 kb fragment encoding methotrexate resistance from 
the plasmi.d pSV2dhfr (Subramani et al (1981) Mol. 
Cell. Biol. 1, 854-864) and a 1.15 kb fragment 
encoding G418 resistance from the plasmid pMClneo 
polyA (Thomas and Capecchi (1987) Cell 51, 503-512). 
The neo gene was obtained as an Xhol-Sall fragment and 
the dhfr gene was obteULned by PCR amplification using 
the primers 5'- 

GGACGCGTGGftTCCAGACATGATAAGATA-S' and 5'- 
GGACGCGTCAGCTGTGGAATGTGTGTCAG-S' designed to add Mlul 
Sites at the ends of the resultant fragment. The neo 
15 and dhfr genes were cloned into the Xhol and Mlul 

sites respectively of pCG.CM7/EP0 to give the plasmids 
pCG.CMV/EPO/DHFR and pCG.CMV/EPO/Neo/DHPR such that 
their transcription is in the same orientation as that 
of CM7. Finally, the 5 kb BamHI-Hindlll fragment from 
PTD.I was added via Clal adapters at the Clal site of 
pC6.CM7/EP0/Heo/DHPR to give pCG.HRl. In pCG.HRl, the 
5' 5kb EPO fragment is in the same orientation as that 
of the 620 bp BstEli-xbal fragment with respect to the 
original lambda clone. 

A 9.54 kb fragment containing the 5' 5kb 
BamHI-Hindlll EPO fragment, the dhfr and 6418 markers, 
the CMV enhancer/promoter and the 620 bp BstEIl-Xbal 
EPO fragment can be released from pCG.HRl as a NotI or 
Sacll fragment. This NotI fragment can be used for 
homologous recombination as it is designed to serve as 
an omega structure in recombination having 5 kb and 
620 bp of homology to facilitate the event (PIG. 3). 

Por electroporation, the DNA was first cut 
with NotI, then extracted with phenol/chloroform and 
35 precipitated by the addition of ethanol before 
centrifugation. The resultant DNA pellet was 
resuspended at a concentration of 2 mg/ml in a volume 
(10 Hi) of 10 mH Tris-HCl, 1 mH EDTA (TE) . 
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Introduction of DNA into cells 
Transformed primary human 293 embryonal 
kidney cells (ATCC CRL 1573) were cultxired in Cellgro 
5 DMEM H16 (Mediatech) supplemented with 10% calf serum, 
glutamine (2 mM) and penicillin 

(100 U/ml) /streptomycin (0.1 mg/ml) and grown at 37^C 
in 5% COj. At 90% confluency, cells were prepared for 
electroporation by trypsinization, concentration by 

10 brief centrifugation and resuspension in PBS at 10^ 

cells/0.8 ml. The cells were equilibrated at 4<>C, and 
DNA (50 fig) restricted with NotI (as described above) 
was added. The mixtiire was electroporated at 960 /iF 
and 260 V with a BioRad Gene Pulser and then iced 

15 again for 10 min before plating onto a 10 cm dish. 

After incubation at 37^C for 48 hr, the cells from a 
10 cm dish were split equally among 5 24-well plates 
in media containig 6418 at 0.6 mg/ml (effective 
concentration) • Under these electroporation 

20 conditions, 4-10 colonies/well survive drug selection 
after 2 weeks. 

Detection of Homologous Recombination by PGR Analysis 
Using NotI restricted DNA from pCG.HRl, 

25 successful homologous recombination is obtained by 

insertion of the 3.8 Jd^ construct at the talrgeted EFO 
locus while simultaneously deleting 1.2 kb of genomic 
sequence (FIG. 3). PGR is used to detect unique 
t£Lrgeting events versus random integration of the DNA 

30 as diagrammed in FIG. 4. Two primers are synthesized, 
one to the 3' end of, CMV and the other to the region 
3' to the Xbal site used for the 620 bp BstEII-Xbal 
fragment in the targeting DNA. A homologous 
recombination event generates a DNA target in the 

35 genome from which these primers produce an 
amplification product of 860 bp. 

In order to detect the targeting event, 
pools of clones (from the electroporated 293 cells) 
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from 4 wells each (representing about 16 colonies) 
were generated by trypsinizing wells and using 90% of 
each well for the pool. The remaining 10% of each 
well was then reseeded back into the well. Genomic 
5 DNft. was t^en prepeured from each pool as follows. The 
cells in each pool were pelleted by centrifugation for 
2 min. in a 1.5 ml microcentrifuge tube, resuspended 
in PBS (20 fil), and treated for 1 hr at 37«>C with a 
solution (400 jil) containing 10 mM Tris-HCl (pH7.5), 
10 100 mM NaCl, 5 mM EDTA, 1% SDS and RNase A (40 /tg/ml) . 
Proteinase K (10 fil, 10 mg/ml) was then added, and the 
samples were incubated for 4 hr at 50*>C before 
extractions by vigorous vortexing with 
phenol/chloroform (200 /tl each), then with chloroform 
15 (400 ;tl), the addition of ethanol (800 ;*1), and 

centrifugation at 25<»C for 10 min. The DNA pellets 
were washed with 70% ethanol, dried and resuspended in 
TE (20 fil) . An average of 40 ftg of genomic DMA was 
obtained from each sample. 

Approximately 1 /*g from each sample of 
genomic DNA was used for PCR analysis. The DMA in a 
volume (10 ;tl) of TE was boiled for 10 min. prior to 
the addition of PCR mix (40 ftl) . The reaction (50 /tl) 
contained 10 mM Tris-HCl (pH 9.0 at 25«»C), 50 mH KCl, 
1.5 mM MgClj, 0.01% gelatin, 0.1% Triton X-100, 200 ;iM 
dNTPs, 1 iM each of the primers 
5'-AAGCAGAGCTCGTTTAGTGAACCG-3' and 5'- 
TGAGCGTGAGTTCTGTGGAATGTG-3', and 1.5 D of Tag DNA 
polymerase (Promega) . Following an initial incubation 
of 94«C for 3 min, the samples were subjected to 45 
cycles of denaturatjion at 94oc for 1 min., annealing 
at 66«»C for 1.5 min. and extension at 72 ®C for 2 min. 
At the end of the 45 cycles, the samples were 
incubated an additional 5 min. at 72 ©C. A portion 
35 (20 /tl) of each sample was analyzed on a 1% agarose 

gel run in TBE and stained with ethidium bromide. Out 
of the 90 pools analyzed from 3 electroporations , two 
samples were identified which exhibited the correct 
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size fragment by ethiditim bromide staining. The DNA 
from the PGR reaction was recovered and subjected to 
restriction mapping with Xbal. The correct 
amplification product should upon treatment with Xbal 
5 yield two fragments, 669bp and 191bp. The samples 

from the two pools both yield fragments of the correct 
sizes. In addition, the sample from pool 1 exhibits 
other bands in the uncut material. 

Following the procedure described 
10 previously, metaphase chromosomes are prepared from 

the recipients demonstrating homologous recombination 
with DHFR and transformed in DHFR deficient CHO cells. 
..After isolating resistant colonies and analyzing for 
expression of EPO, the cell clones are grown in 
15 selective medium containing progressively higher 

concentrations of methotrexate (.02-80 /iH) with steps 
of 4-fold increases in concentration. The cells are 
then harvested, cloned and screened for production of 
EPO. Clones providing for at least 2-fold enhancement 
20 of EPO production are isolated. 

It is evident from the above results, that 
the subject method provides for a novel approach to 
expression of a wide variety of mammalian genes of 
interest. The method is simple, only requires the 
25 knowledge of a sequence of about 300 bp or more in the 
region of a target gene, and one may then use 
substantially conventional techniques for transferring 
the amplifiable region to an expression host, and 
production of the desired product in high yield. 
30 All publications and patent applications 

cited in this specification are herein incorporated by 
reference as if each individual publication or patent 
application were specifically and individually 
indicated to be incorporated by reference. 
35 Although the foregoing invention has been 

described in some detail by way of illustration and 
example for purposes of clarity of understanding, it 
will be readily apparent to those of ordinary skill in 
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the art in light of the teachings of this invention 
that certain changes and modifications may be made 
thereto without departing from the spirit or scope of 
the appended claims. 
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WHAT IS CLAIMED IS : 

1. A method for producing xaammallan 
5 proteins ccmiprisings 

growing mammalian secondary expression host 
cells comprising multiple copies of an amplifiable 
region coiq>rising a target gene heterologous to said 
second£u:y expression host and expressing a protein of 
10 interest and an amplifiable gene, whereby said target 
gene is expressed and said protein is produced; 

wherein said secondary host expression cells 
. are produced by the method comprising: 

transforming primary mammalian cells 
15 comprising said target gene with a construct 

comprising an amplifiable gene and at least one 
flanking region of a total of at least about 150 bp 
homologous with a DNA sequence at the locus of the 
coding region of said target gene to provide 
20 amplification of said target gene, wherein said 
amplifiable gene is at a site which does not 
interfere with the expression of said target gene, 
whereby said construct becomes homologously integrated 
into the genome of said primary cells to define an 
25 amplifiable region; 

selecting for primary cells comprising said 
construct by means of said as^lifiable gene or other 
marker present in said construct; 

isolating DKA portions of said genome from 
30 said primary cells, wherein said portions are large 
enough to include all of said amplifiable region; 

transforming secondary expression host cells 
with said primary cell DNA portions and cloning said 
transformed secondary expression host cells to 
35 produce clones of said secondary expression host 

cells differing in said DNA portions present in said 
secondary expression host cells; 
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selecting clones of said maannalian secondary 
egression host cells comprising said amplifiable 
region; and 

amplifying said an^jlif iable region by means 
5 of an amplifying agent, wherein said amplifying is 
prior to said isolating or after said selecting and 
prior to said growing. 

2. A method according to Claim 1, wherein 
said amplifiable gene is a mammalian DHFR gene. 
10 3. A method according to Claim 1, wherein 

said portions are metaphase chromosomes. 

4. A method according to Claim 1, wherein 
. said portions are restriction fragments. 

5. A method according to Claim 1, wherein 
15 said primary cells are human cells. 

6. A method according to Claim 5, wherein 
said human cells are fibroblast cells. 

7. A method according to Claim 1, wherein 
said construct comprises a biocidal marker providing 
resistance to a biocide for said primary host cells. 

8. A method for producing mammalian 
proteins con^rising: 

transforming mammalian primary mammalian 
cells comprising said target gene with a construct 
25 comprising an amplifiable gene and at least one 

flanking region of at least about 150 bp homologous 
with a DNA sequence within 50 kb of the coding region 
of said target gene, wherein said amplifiable gene is 
at a site which does not interfere with the 
expression of said target gene, whereby said 
construct becomes homologously Integrated into the 
genome of said primary cells to define an amplifiable 
region comprising said amplifiable gene and said 
target gene in said genome; 

selecting for primary cells coinprising said 
construct by means of said amplifiable gene or other 
marker present in said construct; 
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isolating DNA portions of said genome from 
said primary cells, wherein said portions are large 
enough to include all of said amplifiable region; 

transforming mammalian secondary expression 
5 host cells with said primary cell DNA portions, 

wherein said secondary expression host cells are of a 
different species from said primary host cells, and 
cloning said transformed secondary expression host 
cells to produce clones of said secondary expression 
10 host cells differing in said DNA portions present in 
said secondary expression host cells; 

selecting clones of said mammalian secondary 
. expression host cells comprising said amplifiable 
region; 

15 amplifying said amplifiable region by means 

of an amplifying agent, wherein said amplifying is 
prior to said isolating or after said selecting; and 

groing said secondary expression host cells 
comprising multiple copies of said amplifiable region, 

20 whereby said target gene is expressed and said protein 
is produced. 

9. A method according to Claim S, wherein 
said amplifying is with said secondary es^ression host 
cells • 

25 10. A method according to Claim 8, wherein 

said primary cells are human cells. 

11. A method according to Claim 10, wherein 
said human cells are diploid fibroblast cells • 

12. A method according to Claim 8, wherein 
30 said amplifiable gene is a mutated DHFR gene having a 

higher Km than the wild-type gene. 

13. A method according to Claim 12, wherein 
said secondary host expression cell is DHFR deficient. 

14. A method according to Claim 8, wherein 
35 said construct further comprises a marker gisne 

separated from said amplifiable region by an 
homologous flanking region. 
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15. . A human cell cosiprlslng an amplifiable 
gene at other than its wild-type site in the hxnnan 
genome and within the locus of a target gene 
e^ressing a protein to provide aii5>lification of said 

5 target gene. 

16. A human cell according to Claim 14, 
wherein said cell is a normal cell. 

17. A human cell according to Claim 14, 
wherein said cell is a neoplastic cell. 

18. A human cell according to Claim 14, 
wherain said amplifiable gene is a DHFR gene. 

19. A mammalian cell other than a human 

, cell for eiqpression of mammalian proteins in culture 
conqprising an amplifiable region coiq>rising an 

15 amplifiable gene within lOkb of a human wild-type 

gene expressing a protein, trtierein said two genes are 
separated by substantially solely human wild-type 
sequence associated with said target gene and the 
f lanlcing sequence associated with the amplifiable 

20 gens. 

20. A method for producing, cells for 
expression of a heterologous protein in culture, said 
method con^risingi 

transforming mammalian priiaaxy cells 
25 comprising said target gene with a construct 

comprising an amplifiable gene and at least one 
flanking region of at least about ISObp homologous 
with a DNA sequence within lOkb of the coding region 
of said target gene, wherein said amplifiable gene is 
30 at a site which does not interfere with the expression 
of said target gene, whereby said construct becomes 
homologous ly integrated into the genome of said 
primary cells to define an amplifiable region 
comprising said amplifiable gene and said target gene 
35 in said genome; 

selecting for primary cells comprising said 
construct by means of said amplifiable gene or other 
marker present in said construct; 
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isolating DNA portions of said genome from 
said primary cells, wherein said portions are large 
enough to include all of said amplifiable region; 

transforming mammalian secondary expression 
5 host cells with said primary cell ONA portions, 

wherein said secondary expression host cells are of a 
different species from said primary host cells, and 
cloning said transformed secondary expression host 
cells to produce clones of said secondary expression 
10 host cells differing in said DNA portions present in 
said secondary expression host cells; 

selecting clones of said mammalieui secondary 
expression host cells comprising said amplifiable 
region; and amplifying said amplifiable region by 
15 means of an amplifying agent, wherein said amplifying 
is either prior to said isolating or after said 
selecting. 

21. A method according to Claim 20, wherein 
said amplifying is with said secondary expression host 

20 cells. 

22. A method according to Claim 20, wherein 
said primary cells eu:e human cells . 

23. A method according to Claim 22, wherein 
said human cells are diploid fibroblast cells. 

25 24. A method according to Claim 20, wherein 

said amplifiable gene is a mutated OHFR gene having a 
higher Km than the wild- type gene. 

25. A method according to Claim 24, wherein 
said secondary host expression cell is DHFR deficient. 

30 
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